On Foreign Name Search

نویسندگان

  • Jason J. Soo
  • Ophir Frieder
چکیده

We address foreign name search in a highly diverse user community. User sophistication ranges from highly experienced archivists to apprehensive users who shy away from technology; apprehensive users dominate system use. Thus, all system interfaces must assume minimal dependency on the user. Our foreign names search approach, called Segments, is language independent; thus, there is no need to determine the language of origin from the diverse candidate set of thirteen languages. We compare Segments against traditional n-gram and Soundex based solutions. Actual and synthetic queries are used to search a names data set resident in the United States Holocaust Memorial Museum. We also search a subset of the 1990 United States Census Bureau Surnames data set to evaluate the performance of Segments on a predominately language specific (English) collection. Our results demonstrate statistically significant performance gains over both traditional approaches. The described approach supports search efforts at the United States Holocaust Memorial Museum.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying Foreign Person Names in Chinese Text

Foreign name expressions written in Chinese characters are difficult to recognize since the sequence of characters represents the Chinese pronunciation of the name. This paper suggests that known English or German person names can reliably be identified on the basis of the similarity between the Chinese and the foreign pronunciation. In addition to locating a person name in the text and learnin...

متن کامل

Computational Techniques For Improved Name Search

This paper describes enhancements made to techniques currently used to search large databases of proper names. Improvements included use of a Hidden Markov Model (HMM) statistical classifier to identify the likely linguistic provenance of a surname, and application of language-specific rules to generate plausible spelling variations of names. These two components were incorporated into a protot...

متن کامل

What’s in a Name? Asymmetry of Foreign Branding Effects in Hedonic versus Utilitarian Product Categories

Foreign branding (spelling a brand name in a foreign language) is one way to create desirable product associations. In three empirical studies, we apply justification theory to demonstrate that foreign branding increases purchase likelihood if it is compatible with the product category (e.g., a French brand name for hedonic and a German name for utilitarian products). However, incongruence betw...

متن کامل

Exchange Rate Fluctuations, Consumer Demand, and Advertising: The Case of Internet Search

This paper addresses the question of how exchange rates affect consumer demand in markets where advertising plays an important role. We identify an effect that has not been emphasized in the existing literature: when foreign exchange rates appreciate, a foreign product becomes more expensive to domestic consumers, but at the same time, advertising becomes cheaper for the foreign advertiser. Thu...

متن کامل

Using Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine

Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010